Multichannel speech enhancement using Bayesian spectral amplitude estimation

نویسندگان

  • Thomas Lotter
  • Christian Benien
  • Peter Vary
چکیده

This paper introduces two shon-time spectral amplitude estimators for speech enhancement with multiple microphones. Based on joint Gaussian models of speech and noise Fourier coefficients the clean speech amplitudes are estimated with respect to the MMSE or the MAP criterion. The estimators outperform single microphone minimum mean square amplitude estimators when the speech is highly correlated and the noise is sufficiently uncorrelated. Whereas the first MMSE estimator also requires the desired signals to be in phase, the second MAP estimator performs a direction-independent noise reduction. The estimators are generalizations of the well known single channel MMSE estimator derived by Ephraim and Malah and the MAP estimator derived by Wolfe and Godsill respectively.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Enhancement Using Gaussian Mixture Models, Explicit Bayesian Estimation and Wiener Filtering

Gaussian Mixture Models (GMMs) of power spectral densities of speech and noise are used with explicit Bayesian estimations in Wiener filtering of noisy speech. No assumption is made on the nature or stationarity of the noise. No voice activity detection (VAD) or any other means is employed to estimate the input SNR. The GMM mean vectors are used to form sets of over-determined system of equatio...

متن کامل

Distributed multichannel speech enhancement with minimum mean-square error short-time spectral amplitude, log-spectral amplitude, and spectral phase estimation

In this paper, the authors present optimal multichannel frequency domain estimators for minimum mean-square error (MMSE) short-time spectral amplitude (STSA), log-spectral amplitude (LSA), and spectral phase estimation in a widely distributed microphone configuration. The estimators utilize Rayleigh and Gaussian statistical models for the speech prior and noise likelihood with a diffuse noise f...

متن کامل

Beta-order minimum mean-square error multichannel spectral amplitude estimation for speech enhancement

In this paper, the minimum mean-square error (MMSE) ˇ-order estimator for multichannel speech enhancement is proposed. The estimator is an extension of the single-channel MMSE ˇ-order and multichannel MMSE short-time spectral amplitude estimators using Rayleigh and Gaussian distributions for the statistical models under the assumption of a diffuse noise field where the noise is estimated indepe...

متن کامل

Distributed multichannel speech enhancement based on perceptually-motivated Bayesian estimators of the spectral amplitude

In this study, the authors propose multichannel weighted Euclidean (WE) and weighted cosh (WCOSH) cost function estimators for speech enhancement in the distributed microphone scenario. The goal of the work is to illustrate the advantages of utilising additional microphones and modified cost functions for improving signal-to-noise ratio (SNR) and segmental SNR (SSNR) along with log-likelihood r...

متن کامل

Efficient β-order Perceptually Motivated Spectral Amplitude Bayesian Estimator Based On Chi-distribution for Speech Enhancement

The traditional Bayesian estimator of short-time spectral amplitude is based on the minimization of the squared-error cost function under the common Gaussian probability density function (pdf). The Gaussian distribution, however, is not the optimal probability distribution. To overcome this phenomenon, we considered to replace the traditional distribution hypothesis of spectral amplitude of spe...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003